Finite size effects on thermal denaturation of globular proteins 
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Finite size effects on the cooperative thermal denaturation of proteins are considered. A dimen- 
sionless measure of cooperativity, Qc, scales as A^*", where N is the number of amino acids. Surpris- 
ingly, we find that is universal with = 1 + 7, where the exponent 7 characterizes the divergence 
of the susceptibility for a self-avoiding walk. Our lattice model simulations and experimental data 
are consistent with the theory. Our finding rationalizes the marginal stability of proteins and sub- 
stantiates the earlier predictions that the efficient folding of two-state proteins requires Tp ~ Te, 
where Te and Tp are the collapse and folding transition temperatures, respectively. 



Single domain globular proteins, which are finite sized 
systems, undergo remarkably cooperative transitions 
from an ensemble of unfolded states to well ordered folded 
(or native) states as the temperature is lowered (Fig. 
1(a)). In many cases, the transition to the native state 
takes place in an apparent two-state manner, i.e., the 
only detectable species are the native (more precisely, the 
conformations belonging to the native basin of attraction 
(NBA)) or unfolded (U) states ^ Although the micro- 
scopic origin of cooperativity is not fully understood^ , the 
transition to the NBA at the folding transition tempera- 
ture, Tp, is a consequence of the effective interresidue at- 
traction that compensates for the entropy loss. From this 
perspective the NBA U transition can be viewed as 
a phase transition in a finite-sized system. Furthermore, 
the transition to the NBA at Tp has the characteristics 
of a first order phase transition^'^. Many experiments 
have shown that folded states of globular proteins are 
only marginally stable below Tp. The free energies of sta- 
bility of the NBA, relative to the U states, vary within 
the range of (5 — 20)A;bT at neutral pH [lb]. Because 
proteins are polymers we expect that they would also 
undergo a collapse transition to a compact phase at the 
temperature Tg suitably modified for finite size systems, 
when water becomes a poor solvent for the polypeptide. 
We have previously shown that for protein sequences that 
fold in an apparent two-state manner Tp « Tg, which 
naturally explains the marginal stability of proteins^. 

The quest to understand, at the molecular level, the 
cooperative U ^ NBA transition has lead to a number 
of computational studies [2b, 4, 5]. Although considerable 
effort has been directed to describe the molecular basis 
of cooperativity, somewhat surprisingly, examination of 
the finite size effects in the self-assembly of proteins has 
received little attention^. In contrast, scaling theories for 
finite sized systems undergoing regular first and second 
order phase transitions have been fully developed^. The 
purpose of this paper is to study the effect of N, the 
number of amino acid residues in a protein, on the extent 
of cooperativity in the U ^ NBA transition. 

Thermal denaturation data of wild-type ( WT) proteins 
and lattice models (LMs) of polypeptide chains are used 
to examine the dependence of the cooperativity on N. 



We show that a dimensionless measure of cooperativitji^ 
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where /^r is a measure of occupation of NBA, AT is the 
full width at half- maximum of df^/dT, and Tp is the 
folding transition temperature identified with the maxi- 
mum in df^/dT. We find that 



C - 1 + 7 



(3) 



where 7 is the exponent that characterizes the divergence 
of susceptibility at the critical point for a n-component 
ferromagnet with n = 0, i.e., for a self-avoiding walk. As 
a byproduct of this study we also show that ^ 
The parameter flf. is a convolution of the sharpness of the 
transition (Tp/AT) and the extent to which structure, 
as measured by /a? , changes around Tp . For infinite sys- 
tems undergoing sharp transitions, 00, whereas Vic 
is small for broad or highly rounded phase transitions^. 
The relationship given in Eq. Q, which can only be 
valid near Tg, establishes the proposal that Tp w Tg for 
two-state foldersM. 

To establish the results given above we used thermal 
and chemical denaturation data together with the LMs 
of a polypeptide chain to compute the growth of flc with 
N. In the LM each amino acid is represented as a sin- 
gle bead confined to the vertices of a cubic lattice [2c]. 
The energy of a conformation specified by the positions, 
{ri}{i = l,2,...,iV), is E{fi} = Y,i<j ^ij^r,i,a, where 
a is the lattice spacing, Vij = [r^ — rj\, 5x,a is the Kro- 
necker delta function. The contact energies = — 1, 
if the interaction between beads i and j in a given con- 
formation is also present in the native state (i.e., the 
lowest energy conformation for a given sequence), and is 
zero, otherwise. Even though simple LMs do not quan- 
titatively capture the cooperativity of folding transitions 
in proteins?:, they are useful for obtaining global folding 
properties. The precise choice of Cij should not affect the 
predicted universal scaling of fie with N . Our purpose 



2 



in undertaking LM Monte Carlo (MC) simulations is to 
show that Eqs. Q"® should be valid for any model of 
proteins that exhibits a cooperative U ^ NBA transi- 
tion. 

To calculate ilc for LMs we employ the temperature 
dependence of the overlap function'^ 
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where r% is the distance between beads i and j in the 
native conformation. The overlap function x is an order 
parameter that distinguishes the NBA and U states. 
The folding transition temperature Tp can be estimated 
from the location of the maximum in d < x > /dT, 
where < ... > indicates a thermal average. For LMs 
< X >« 1 — /n^- Therefore, Eq. Q may be evaluated 
using 



nr. = 



AT 



d<X> 
dT 



(5) 



T=Tp 



Analysis of experimental and simulation data: To es- 
tablish the results given above we first analyzed thermal 
denaturation data for WT proteins. As an example we 
show in Fig. [Hthe plot of /Ar(T) and d/Ar(r) /dT for villin 
{N = 35) and ADA 2h {N = 80>i^. In accord with Eq. 
^ we find that the thermal denaturation of ADA 2h is 
more cooperative than that of villin headpiece. 

From Fig. 2 we find that from thermal denatu- 
ration data for 32 WT proteinsi^, scales as iV"'*' with 
A = 1.08 ± 0.04. Given that the data for these pro- 
teins are obtained under varying experimental conditions 
and using different methods for computing the enthalpy 
and entropy changes at Tp, the agreement between the 
predicted and observed behavior is excellent. For LMs 
AT/Tp - N'^ with A = 1.14 ±0.06 (Fig. 2). The smaU 
deviation of A from unity in LMs is, in all likelihood, 
due to the simplicity of the a-carbon representation of 
the polypeptide chain that does not capture the crucial 
role of side chains. Inclusion of side chains, which are 
tightly packed in native conformations, is expected to 
reduce fluctuations. Moreover, for iV < 40 most of the 
beads are on the surface, which also leads to considerable 
conformational fluctuations. Therefore, the expected re- 
lation ^ ~ holds nearly quantitatively. 

The dependence of fie on N for WT proteins and LMs 
shows that f2c ~ N'^ (Fig- (3)). From the linear fit to 
the log-log plot of the data we find ( « 2.17 ± 0.09 for 
WT proteins and w 2.33 ± 0.08 for LMs. The 5th order 
e expansion for polymers using n-component theory 
with n = gives 7 = 1.23i&. Thus, from Eq. ^ we 
predict that ( w 2.22. Thus, the data for WT proteins 
and LMs are consistent with the theoretical prediction 
(Eq. 0). We should emphasize that the robustness of 
the fit has been checked using different fitting procedures. 
The remarkable finding relating the critical exponent 7 to 
thermal denaturation of proteins gives further credence 
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FIG. 1: (a) Temperature dependence of the fraction of occu- 
pation of the native state, /jv(r), and its derivative d/jv/dT. 
The dotted lines are for villin head piece and the solid lines 
show the data for ADA 2h— . Temperature is measured in 
Centigrades. (b) Dependence of l-< x > d < x> /dT on 
temperature for LMs. We calculate AT using d < x > /dT. 
The dotted lines are for the sequence with A'^ = 48 and the 
solid lines correspond to A = 80. 



to the proposal that efficient folding is achieved at Tp « 
Tg^. It also suggests that U ^ NBA transition is only 
weakly first order, thus explaining the marginal stability 
of globular proteins. 

Most folding experiments are performed by titrating 
with denaturants (urea or guanidine hydrochloride). At 
denaturant concentrations above the midpoint Cm (at 
which the populations of the folded and unfolded states 
are equal) proteins are denaturated. Thus, phase transi- 
tions to the NBA occur by varying denaturant concen- 
tration. In analogy with Eq. we computed, for 33 WT 



proteins^'', Uc = ^|^|c=c„ and where AC is 
the full width at half-maximum of dfM/dC. The plots of 
lnA£ and InVLc as a function of InA yield A « 1.22 ± 0.14 
and C « 2.45 ± 0.29, respectively (see the insets to Figs. 
(2,3)). Thus, the scaling of Vic and AC /Cm remains es- 
sentially unchanged even though the chemical and ther- 
mal denaturation mechanisms are vastly different. This 
result also suggests that ( is universal. However, the 
dependence of Infic on InA has a correlation coefficient 
of only about 0.6 compared to 0.95 for thermal denatu- 
ration. We believe that larger uncertainties result from 
greater drift in the experimental signals in denaturant- 
induced unfolding compared to thermal denaturation^. 
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FIG. 2: The sharpness of the folding transition AT/Tf as 
a function of A*'. Open circles represent the results from LM 
simulations with the corresponding fit (dotted line) AT/Tf ~ 
N'^ with A = 1.14 ± 0.08. The linear fit (solid line) to the 
experimental data for 32 WT proteins (solid circles) gives 
A = 1.08 ± 0.04. The correlation coefficient for In AT/Tf 
and In A*' is 0.95. For clarity LM data are shifted up by 0.4. 
Inset shows the dependence of the width of folding transition 
InAC/Cm for chemical denaturation on In TV. The linear fit 
to the data points collected for 33 WT proteins yields A = 
1.22 ± 0.14 (the correlation factor is 0.59). 
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FIG. 3: Plot of Infic as a function of IniV. Symbols are 
the same as in Fig. (2). The dotted line is a fit to the LM 
data, which gives ( = 2.33 ±0.08. The solid line is a fit to the 
experimental values of Qg^^- with the exponent ^ = 2.17±0.09. 
The correlation coefficient for lnf2c and In A'" is 0.95. The LM 
data are shifted down by 0.7. The dependence of the folding 
cooperativity on N for chemical denaturation is plotted in the 
inset. The linear fit to experimental data (solid line) results 
in the exponent ^ — 2.45±0.29 (the correlation factor is 0.59). 
Both sets of experimental data rule out C = 2. 



The rationale for Eqs. (2,3) is based on the follow- 
ing arguments. (1) By analogy with magnetic systems 
Ax is similar to susceptibility and should be given by 
Ax — Td < X > /9h, where ft. is a "magnetic" or an 
ordering field conjugate to x- Because Ax is dimen- 
sionless, we expect that the ordering field h ^ T and 
thus Td < X > / dT in proteins is similar to magnetic 
susceptibility. (2) Camacho and Thirumalai.2, have sug- 
gested that efficient folding in apparent two-state folders 
requires Tp f^Tg. Because the transition at Tg is usually 
second order—, while the one at Tp is first order [2c, 11], 
the Tp ~ Tg condition implies that folding of two-state 
globular proteins occurs near a tricritical poin1»^. There- 
fore, the critical exponents that control the behavior of 
the polypeptide chain at Tg should manifest itself in the 
U <-> NBA phase transition. Using these arguments we 
can obtain the N dependence of flc in the following way. 
In general, we expect that close to T Tg ^ Tp the 
Flory radius^^ Rp ~ AT""" N" {Rp is the analogue of 
the correlation length in magnetic systems). This implies 
that AT/Tp ~ N~^. Because of the analogy to magnetic 
susceptibility, we expect Td < X > /dT ^ iV". Using 
Eq. (O we obtain the expected relationship ilc ^ iV^+''', 
which directly follows from the hypothesis that Tp w Tg 
for efficient two-state folders^. 

The scaling Vic ^ N'' with C clearly different from 2 
may appear to be at odds with the idea that the struc- 
tures in the NBA are sequence-specific. However, the 
global characteristics embodied in the growth of flc with 
N are valid only at T k, Tp. In the neighborhood of 
this temperature the general characteristics of the U ^ 
NBA transition are governed by the properties of the 
unfolded states as Tp is approached from above. It has 
been shown that in the denaturated states (T > Tp) 
the global properties like the gyration radius Rg ^ N" 
with V w 0.59 as expected for homopolymerc^'^ . Similarly, 
the homopolymeric nature around Tp is reflected in the 
growth of Vic with N . 

The finding that the folding transition at Tp occurs at 
a tricritical point suggests that the native states of natu- 
ral proteins are only marginally stable. Because biologi- 
cal functions require transitions between different states, 
it is logical to postulate that natural foldable proteins 
have evolved to ensure Tg k, Tp. The coil-globular tran- 
sition at Tg is likely to be a second order transition in- 
volving no discontinuity in the free energy. At Tp the 
transition is of the first order. The closeness of Tp and 
Tg implies that the discontinuity of the free energy at Tp 
cannot be large. As a result the folded state is expected 
to be only marginally stable with respect to the ensem- 
ble of denatured states. As argued elsewhere^ this con- 
dition is also equivalent to maximizing the ratio Tp/Tg, 
where Tg is a glass transition temperature^^. Marginal- 
ity condition may also be a requirement for robustness 
of the folded state. This may explain why small single 
domain proteins can tolerate a large number of muta- 
tions without substantial changes in the native state. It 
is also likely, as recently shown, that evolution has led to 
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marginally stable proteins that have maximum sequence- 
structure compatibilitjii2i22i. 
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